Picture for Hao Bai

Hao Bai

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents

Add code
Jun 01, 2026
Viaarxiv icon

PRO-CUA: Process-Reward Optimization for Computer Use Agents

Add code
May 27, 2026
Viaarxiv icon

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Add code
Jan 20, 2026
Viaarxiv icon

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Add code
Jan 07, 2026
Viaarxiv icon

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Add code
Oct 14, 2025
Viaarxiv icon

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Add code
Jun 09, 2025
Figure 1 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 2 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 3 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 4 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Viaarxiv icon

Improving Neuron-level Interpretability with White-box Language Models

Add code
Oct 21, 2024
Viaarxiv icon

NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations

Add code
Jul 18, 2024
Figure 1 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 2 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 3 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 4 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Viaarxiv icon

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Add code
Jun 14, 2024
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon